o 






o 



X 



Lift-and-Project Integrality Gaps for the Traveling Salesperson 

Problem 

Thomas Watson* 
July 8, 2011 

Abstract 



We study the lift-and-project procedures of Lovasz-Schrijver and Sherali- Adams applied to 

the standard hnear programming relaxation of the traveling salesperson problem with triangle 

inequality. For the asymmetric TSP tour problem, Charikar, Goemans, and KarlofF (FOGS 

2004) proved that the integrality gap of the standard relaxation is at least 2. We prove that 

C/J ' after one round of the Lovasz-Schrijver or Sherali- Adams procedures, the integrality gap of the 

^^ . asymmetric TSP tour problem is at least 3/2, with a small caveat on which version of the 

^ ' standard relaxation is used. For the symmetric TSP tour problem, the integrality gap of the 

O . standard relaxation is known to be at least 4/3, and Gheung (SIOPT 2005) proved that it 

remains at least 4/3 after o{n) rounds of the Lovasz-Schrijver procedure, where n is the number 

of nodes. For the symmetric TSP path problem, the integrality gap of the standard relaxation 

^ ' is known to be at least 3/2, and we prove that it remains at least 3/2 after o{n) rounds of the 

'^ . Lovasz-Schrijver procedure, by a simple reduction to Cheung's result. 

(N 



1 Introduction 



The traveling salesperson problem (TSP) is the following: Given a complete directed graph with 
nonnegative edge distances satisfying the triangle inequality, the goal is to find a shortest route 
that visits all the nodes. The TSP is one of the most fundamental and well-studied problems in 
combinatorial optimization (with whole books devoted to it [LLRKS85| lABCCOTl [GP07J), and 
there are many variants. For example, in the tour version the goal is to find a shortest hamiltonian 
. . ! cycle, while in the path version we are additionally given two nodes s and t and the goal is to find 

a shortest hamiltonian path from s to t. Along a diff^erent dimension, in the symmetric version we 
assume that for every pair of nodes u and v the distance from u to f equals the distance from v to 
u, while in the asymmetric version we make no such assumption. 

The approximability of the TSP is not well-understood, and there are large gaps in the approx- 
imation ratios between the best NP-hardness results and the best approximation algorithms. For 
all four of the above variants, the best NP-hardness results are due to |P V06j : For every constant 
e > 0, it is NP-hard to approximate the symmetric tour and path versions within 220/219 — e, 
and it is NP-hard to approximate the asymmetric tour and path versions within 117/116 — e. The 
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state-of-the-art polynomial-time approximation algorithms achieve approximation ratios 3/2 for the 
symmetric tour version |Chr76] . 5/3 for the symmetric path version |Hoo91] . and 0(logn/loglogn) 
for the asymmetric tour and path versions AGM"*" 10| IFS07] where n is the number of nodes. There 



are better algorithms for several important restricted classes of distance functions. For example, 
there have been breakthroughs for shortest path metrics on unweighted undirected graphs: For 
the symmetric tour version, a (3/2 — eo)- approximation (for some constant eo > 0) is obtained in 
[OGSSIO] and a 1.461-approximation is obtained in [MSllj . and for the symmetric path version, a 
(5/3 — eo)-approximation (for some constant eo > 0) is obtained in [OGSSlOl IAS11| and a 1.586- 
approximation is obtained in |MS11| . The symmetric tour version can be approximated within l-|-e 
(for every constant e > 0) for Euclidean distances in constant-dimensional real space |Aro98] and 



for shortest path metrics on weighted undirected planar graphs [ AGK"'"98[ IKle08| . The asymmetric 



tour version has a 0(l)-approximation algorithm for shortest path metrics on weighted directed 
planar graphs [OGSllj . 

For problems such as TSP where the known NP-hardness lower bounds on approximation ratios 
are quite weak, a natural goal is to get stronger lower bounds for restricted classes of algorithms. 
Such lower bounds can sometimes be construed as evidence that the problem is indeed hard to 
approximate within the stronger bound, and they may have the additional advantage of being 
unconditional (NP-hardness lower bounds are conditional on P ^ NP). For many combinatorial 
optimization problems, one very general class of algorithms is those that solve some linear pro- 
gramming relaxation of the problem, "round" the solution to an integral solution in some way, and 
derive their approximation guarantee by comparing the value of the rounded solution to the optimal 
value of the relaxation. The integrality gap of an instance is the ratio between the optimal values 
of the original problem and the relaxation. The existence of a family of instances with integrality 
gap at least a > 1 proves that no such rounding algorithm can achieve approximation ratio better 
than a. Thus exhibiting instances with large integrality gaps constitutes an unconditional lower 
bound against this class of algorithms. The integrality gap of a relaxation for a problem is defined 
to be the maximum integrality gap over all instances (expressed as a function of instance size). 

For the above variants of the traveling salesperson problem, there is a classic and well-studied 
linear programming relaxation which has been called the Dantzig-Fulkerson-Johnson relaxation 
(after |DFJ54| ). the Held-Karp relaxation (after |HK701 IHK71J ). and the subtour elimination re- 
laxation. Since there is no consensus on what to call it, we simply refer to it as the standard 
relaxation. Many of the known approximation algorithms for TSP work by (implicitly or explic- 
itly) rounding solutions to the standard relaxation. For the symmetric tour version, the integrality 
gap is known to be at least 4/3. It is conjectured to be exactly 4/3, and proving this is a notorious 
open problem in combinatorial optimization. The best upper bound on the integrality gap is 3/2 
[WolSOj ISW90| , though in the case of shortest path metrics on unweighted undirected graphs the 
upper bound has been improved to 3/2 — eo (for some constant eo > 0) [OGSSlO] and further to 
1.461 [MSllj . For the symmetric path version, the integrality gap is known to be at least 3/2 and 
at most 5/3 |ASllj , though in the case of shortest path metrics on unweighted undirected graphs 
the upper bound has been improved to 5/3 — eo (for some constant eo > 0) [OGSSlOl lASllj and 
further to essentially 1.586 |MS11| . For the asymmetric tour version, the integrality gap is known 
to be at least 2 |CGK06| and at most 0(logn/loglogn) |AGM+10] . For the asymmetric path 
version, the integrality gap is known to be at least 2 JCGK06] and at most O(logn) |FSS1 0]. 

Although integrality gap lower bounds rule out algorithms that derive their approximation 
guarantees by comparing the output to the optimum of a particular relaxation of the problem, 



one can always construct tighter and tighter relaxations in hope of reducing the integrality gap 
and getting improved approximation guarantees. Hence it is desirable to prove lower bounds 
for large classes of relaxations, since this constitutes stronger evidence for the hardness of the 
problem. Lovasz and Schrijver JLS91J and Sherali and Adams |SA90j introduced powerful and 
general techniques for constructing hierarchies of tighter and tighter relaxations for any problem 
(see |Tulll| for a survey). These techniques are called "lift-and-project" procedures, and there has 
been substantial interest in proving integrality gap lower bounds for various relaxations produced by 
these procedures (a small sample of results includes |AATn5l[STTn7ll(;;MPTinll(MMn9[l(;;MTn90 . 
A nice feature of such results is that lower bounds for sufficiently many rounds of the procedure 
allow us to rule out slightly-subexponential time rounding algorithms, something that PCP-based 
results often fail to do. 

We consider the task of proving integrality gap lower bounds for lift-and-project procedures 
applied to the standard relaxation of the TSP. There is only one previous paper on this topic: 
Cheung [CheOSj proved an essentially tight result for the Lovasz- Schrijver procedure applied to the 
symmetric TSP tour problem — he showed that the integrality gap remains at least 4/3 after o{n) 
rounds of the procedure for n-node instances. A natural question is whether the integrality gap 
lower bound of 2 for the asymmetric TSP tour problem [CGK06J survives these lift-and-project 
procedures. Our main theorem is a positive result in this direction: We prove that the integrality 
gap is at least 3/2 after one round of the Lovasz- Schrijver procedure (which is equivalent to one 
round of the Sherali- Adams procedure). There is a small caveat though: There are two versions 
of the standard relaxation which are both widely used in the literature and which are equivalent 
in a certain sense, but one seems to become somewhat weaker than the other after lift-and-project 
procedures are applied. We can only prove our lower bound for the weaker version (see Section 11.21 
for details). 

One key challenge in proving Lovasz-Schrijver integrality gap lower bounds is in designing so- 
called protection matrices. None of the previous techniques for designing protection matrices (for 
problems such as vertex cover, constraint satisfaction problems, and symmetric TSP) seem to help 
with asymmetric TSP. We introduce a new technique for our setting, based on finding certain 
combinatorial objects we call frames in directed graphs. 

Another natural question is whether the folklore integrality gap lower bound of 3/2 for the 
symmetric TSP path problem survives lift-and-project procedures. We show that it does, in fact, 
survive o{n) rounds of the Lovasz-Schrijver procedure, by giving a simple reduction to Cheung's 
result |Che05] . The reduction is not generic, but rather exploits specific properties of Cheung's 
family of instances. This result can be considered evidence that the symmetric TSP path problem 
cannot be approximated better than 3/2 (even by slightly-subexponential time algorithms). The 
fact that we get a lower bound of 3/2 in both this result and our result for the asymmetric TSP 
tour problem is a coincidence; the techniques used are quite different. 

1.1 Definitions 

In Section n. 1.11 we give the definitions of integrality gaps and the Lovasz-Schrijver lift-and-project 
procedure. In Section 11.1.21 we give the definitions of the relevant variants of the traveling sales- 
person problem and their standard linear programming relaxations. 



1.1.1 Lift-and-Project Integrality Gaps 

For our purpose a (combinatorial optimization) problem is a set of instances, where each instance 
has a set of points I C {0, 1}*" and a nonnegative convex objective function /, and the goal is to 
minimize / over points in /. In the notation, we suppress the dependence of /, m, and / on the 
instance. The instances of a problem may have some size parameter n (not necessarily equal to m). 
A relaxation of a problem associates to each instance a convex set R C [0, 1]™ such that I Q R. 
Given a relaxation of a problem, the integrality gap of an instance is defined to be ^j^^^fTji) ^^^ 
the integrality gap of the relaxation is defined to be the maximum integrality gap of an instance, 
expressed as a function of the instance size n. If the objective function for a problem is always 
linear, and the set R for a relaxation is always a polytope, then we call the relaxation a linear 
programming relaxation. 

Lovasz and Schrijver [LS91J introduced a general technique for improving linear programming 
relaxations. Given a convex set R C [0, 1]™, the Lovasz-Schrijver procedure produces a new convex 
set N{R) C [0, 1]"* which is a polytope if R is. The procedure can be applied iteratively, and we use 
N^{R) to denote the convex set after r rounds (for example N'^{R) = N{N(R))). The A^ operator 
has the following properties: R ^ N{R) D conv(i? n {0,1}"^) and N'^{R) = conv(i? n {0, 1}™) 
where conv(-) denotes convex hull, and thus 

R 5 N{R) D N^{R) 5 ••• 5 N"'^\R) D iV™(i?) = conv(i? n {0, 1}™). 

Hence the procedure can be applied to a relaxation of a combinatorial optimization problem to 
yield new relaxations, where the integrality gap is nonincreasing with the number of rounds and 
drops to 1 after at most m rounds if RCi {0, l}*" = /. Furthermore, if R has a polynomial-time 
separation oracle, then linear objectives can be optimized over N^{R) in time "mP^"^' . 
We now define the A^ operator. Given a convex set R C [0, 1]™ we define 

cone(i?) = {(A, Axi, . . . , Axm) : A > and (xi, . . . ,Xm) G -RJ 

and we index the A coordinate by 0. For an [m + 1) x [m + 1) matrix X we index the top row 
and left column by 0, and we use Xi to denote the iih. row (i G {0, 1, . . . , m}). Given a convex set 
R C [0, 1]™", a point x G [0, 1]"* is in N[R) if and only if there exists an {m + 1) x [m + 1) matrix 
X, called a protection matrix^ such that 

(i) X is symmetric, 
(ii) Xq = diag(X) = (1 x), and 

(iii) Xi € cone(i?) and Xq — Xj € cone(i?) for each z € {1, . . . , m}. 

The procedure is called a "lift-and-project" procedure because it first lifts i? to a convex set in 
[0, 1](™+^) (namely the set of matrices satisfying (i), (ii), and (iii) for some x) which is a polytope 
if R is, and then projects on certain coordinates to get N{R). Lovasz and Schrijver also introduced 
a stronger operator A^_|_ which is defined in the same way as N except the protection matrix is also 
required to be positive semidefinite. For all convex sets R C [0, 1]™ we have N^{R) C N{R) and 

R D N+{R) 5 ••• 5 N':;j{R) = conv(i?n{0,l}'"). 

Sherali and Adams |SA90) introduced a lift-and-project procedure for improving linear pro- 
gramming relaxations which is more powerful than the Lovasz-Schrijver procedure, but also more 
difficult to prove integrality gap lower bounds for. One round of Sherali-Adams coincides with one 
round of Lovasz-Schrijver. We do not define the procedure for higher rounds here. 



1.1.2 The Traveling Salesperson Problem 

We now formally define the traveling salesperson problem and its standard linear programming 
relaxation |DFJ541 IHK701 IHK71J . For completeness, we define all four variants discussed above. 
Given an undirected graph G = {V, E) and a set 5 C 1/ we let 5{S) denote the set of edges crossing 
the cut (5, S). For f € ^ we define 5{v) to be 5{{v}). Given a directed graph G = (F, E) and a set 
S" C y we let 5^{S) denote the set of edges leaving S and 6^{S) denote the set of edges entering 
S. For t) € y we define 5^{v) to be 5'^{{v}) and 5~{v) to be 5^{{v}). For every graph G = (F, E) 
(undirected or directed), given a set F C £) and a vector x = {xe)eeE we define x{F) = X^eeF^e- 
Also, given two vectors d = {de)eeE and x = {xe)eeE we define d-x = Yle&E deXe- We say a directed 
graph is complete if every pair of nodes has both edges between them and there are no self loops. 
We use Kn to denote both the complete undirected graph on n nodes and the complete directed 
graph on n nodes; it will always be clear from context which is meant. 

Symmetric tour version. Given an undirected graph G = {V,E), the symmetric tour polytope 
ST{G) has variables x = {xe)e&E and is defined by the following constraints. 

x{5{S)) > 2 V0C5cy 

x{5{v)) = 2 V V G y 

Xe G [0, 1] V e G £■ 

We define /S'r™*(G) similarly but require Xe G {0, 1} for all e G E; note that 5T™*(G) consists of 
exactly the hamiltonian cycles in G. The symmetric TSP tour problem is the following: Given a 
complete undirected graph Kn = iV-iE) with nonnegative edge distances d = {d(,)e<^E satisfying 
the triangle inequality, minimize d ■ x subject to x G ST^'^^{Kn). The standard relaxation allows 
X G ST{Kn). 

Symmetric path version. Given an undirected graph G = {V,E) and distinct nodes s,t & V 
(the case s = t is covered by the symmetric tour version), the symmetric path polytope SP{G,s,t) 
has variables x = (xe)eg_E and is defined by the following constraints. 

xi6{S)) > 2 V0CScy with |5n{s,t}| /I 

x{6{S)) > 1 V0CScy with |5n{s,t}| =1 

x{5{v)) = 2 y veV\{s,t} 

x{S{v)) = 1 VvG{s,t} 

Xe G [0, 1] y eeE 

We define SP'"^\G,s,t) similarly but require Xe G {0,1} for ah e G E; note that SP'"^\G,s,t) 
consists of exactly the hamiltonian paths in G with s and t as their endpoints. The symmetric TSP 
path problem is the following: Given a complete undirected graph Kn = {V, E) with nonnegative 
edge distances d = {de)e&E satisfying the triangle inequality, and given distinct nodes s,t G V, 
minimize d ■ x subject to x G SP^^^{Kn, s, t). The standard relaxation allows x G SP{Kn, s, t). 



Asymmetric tour version. Given a directed graph G = {V^E), the asymmetric tour polytope 
AT{G) has variables x = {xe)e€E and is defined by the following constraints. 

x{6+{S)) > 1 V0C5cy 

xi6'{S)) > 1 V0C5cy 

x{6+{v)) = 1 y veV 

x{6~{v)) = 1 \/ veV 

Xe G [0, 1] y e£ E 

Of course, the second group of constraints is redundant given the first, but we prefer to include 
it for aesthetic reasons. We define AT™*(G) similarly but require Xg G {0,1} for all e £ E; note 
that ^r™*(G) consists of exactly the hamiltonian cycles in G. The asymmetric TSP tour problem 
is the following: Given a complete directed graph K^ = {V, E) with nonnegative edge distances 
d = {de)e&E satisfying the triangle inequality, minimize d ■ x subject to x G AT™*(K„). The 
standard relaxation allows x G AT{Kn)- 

Asymmetric path version. Given a directed graph G = {V, E) and distinct nodes s, i G V (the 
case s = t is covered by the asymmetric tour version), the asymmetric path polytope AP{G,s,t) 
has variables x = {xe)eeE and is defined by the following constraints. 

x{6+{S)) > 1 V0C 5CF\{t} 

x{5'{S)) > 1 V0C5C1/\{s} 

x{6+{v)) = 1 VwG V\{t} 

x{6-{v)) = 1 yv£V\{s} 

x{5^{t)) = 

x{d-{s)) = 

xe G [0, 1] V e G ^ 



The importance of the constraints x{6^{t)) = x{6~{s)) = was clarified in Nag08| . We define 
AP"*(G, s,i) similarly but require Xg G {0,1} for all e e E; note that AP'°*(G, s,i) consists 
of exactly the hamiltonian paths in G from s to t. The asymmetric TSP path problem is the 
following: Given a complete directed graph Kn = (V, E) with nonnegative edge distances d = 
[de)e<^E satisfying the triangle inequality, and given distinct nodes s,t G F, minimize d ■ x subject 
to X G 74P'"*(/sr„,s,t). The standard relaxation allows x G AP{Kn,.,s^t). 

Although the cut constraints in the four polytopes ST^ SP, AT, AP look rather different from 
each other, they are actually very uniform: Every cut should have capacity at least 2 (symmetric 
versions) or at least 1 in both directions (asymmetric versions), unless the cut separates s and t 
in the path versions, in which case it should have capacity at least 1 (symmetric version) or at 
least 1 from s to i (asymmetric version). Note that although all four polytopes have exponentially 
many constraints, they each have a polynomial-time separation oracle using min-cut computations. 
Hence linear objectives can be optimized over these polytopes in polynomial time using the ellipsoid 
algorithm. 



1.2 Results 

There is only one previous result on lift-and-project integrality gaps for the traveling salesperson 
problem: Cheung |Che05) proved that the integrality gap of the standard relaxation of the symmet- 
ric TSP tour problem remains at least 4/3 — o(l) even after o(n) rounds of the Lovasz-Schrijver A^_|_ 
procedure. We consider whether a lower bound better than 4/3 can be obtained for the asymmetric 
TSP tour problem. Although we cannot prove a better lower bound for the standard relaxation, we 
can prove a new result for the following relaxation, which is equivalent to the standard relaxation 
in a certain sense. 

Given a directed graph G = {V,E), the balanced asymmetric tour polytope ATbai(G) has vari- 
ables X = (xe)ee-E and is defined by the following constraints. 

x((5+(5)) > 1 V0c5cy 

x{S^{S)) > 1 V0C5cy 

x{6+{v)) = x{6-{v)) y veV 

Xe € [0, 1] V e e S 

As before, the second group of constraints is redundant given the first, but we prefer to include it 
for aesthetic reasons. We define AT^^{G) similarly but require Xe G {0, 1} for all e G E. 

We have AT{G) C ATbai(G) and thus the balanced asymmetric tour polytope yields a relaxation 
for the asymmetric TSP tour problem, which we call the balanced standard relaxation. It is well- 
known that ^Tbai(-fi^n) is equivalent to AT{Kn), and AT^^^^Kn) is equivalent to ^T™*(Er„), in 
the sense that they have the same minimum values under any objective d ■ x where d satisfies 
the triangle inequality (see for example |CGK06l Ngu08| ). For this reason, both formulations are 



very commonly used in the literature and are considered interchangeable. However, they are not 
necessarily interchangeable after lift-and-project procedures are applied; N{AT{Kn)) might have a 
smaller integrality gap than N {ATba_i{Kn)) ■ The integrality gap of ^Tbai(-ft^n) does drop to 1 after 
at most m = n{n — 1) rounds of N, due to the equivalence of AT^^{Kn) and AT™*(i^„). 

Theorem 1. The integrality gap of one round of the Lovasz-Schrijver / Sherali-Adams procedure 
applied to the balanced standard relaxation of the asymmetric TSP tour problem is at least 3/2— o(l). 

We prove Theorem [1] in Section [2j Note that the lower bound of 3/2 — o(l) beats the lower 
bound of 4/3 — o(l) that follows trivially from Cheung's result for the symmetric TSP tour problem. 
There are four deficiencies in Theorem [1] that it would be nice to overcome. 

(1) We only handle a single round of lift-and-project; ideally we would like to handle at least a 
superconstant number of rounds. 

(2) We only handle the balanced standard relaxation; ideally we would like to handle the standard 
relaxation as defined in Section [1.1.21 

(3) The integrality gap lower bound is only 3/2 — o(l); ideally we would like to match the lower 
bound of 2 — o(l) due to |CGK06] which holds for zero rounds of lift-and-project. 

(4) The lower bound only holds for the Lovasz-Schrijver A^ procedure; ideally we would like to 
handle the stronger N^ procedure. 



The instances we use to witness Theorem [T] are the same as the ones constructed in |CGK06J . 
but we need a new analysis showing that a certain fractional solution survives one round of lift- 
and-project. The heart of the analysis involves finding certain sets of edges, which we call frames, 
in the graphs constructed in |CGK06j . We need to find a frame associated with each edge, with 
the property that for every pair of edges ei and 62, 62 is in ei's frame if and only if ei is in 62 's 
frame (this property is related to the symmetry requirement for protection matrices). 

We now turn to the symmetric TSP path problem. By giving a simple reduction to Cheung's 
result |Che05] . we show that the folklore 3/2 — o(l) lower bound on the integrality gap of the 
standard relaxation survives o{n) rounds of the Lovasz-Schrijver A^+ procedure. The reduction is 
not a generic reduction, but rather exploits special properties of Cheung's family of instances. 

Theorem 2. The integrality gap of o{n) rounds of the Lovdsz-Schrijver A^+ procedure applied to 
the standard relaxation of the symmetric TSP path problem is at least 3/2 — o(l). Moreover, this 
lower bound holds even for instances that are shortest path metrics on unweighted undirected graphs. 



We prove Theorem [2] in Section [3l 



2 Asymmetric TSP Tour Problem 

In this section we prove Theorem [TJ For every e > we need to construct an instance d = {de)e&E 
on Kn = iy, E) such that 

min„p /iT^intcR' \d ■ X 

""^^^ ^^"> > 3/2 -e. (1) 

mm^.g;v(ATb,i(^n)) d ■ X 

The instances we use are the same ones constructed by Charikar, Goemans, and Karloff [CGK06] . 
For integers k > 1 and r > 2 they construct a directed graph Gk^r with nonnegative edge weights 
as follows (see the illustrations in Figures [H [2l and [3]). The graph Gi^r consists of two paths of 
r + 1 edges on the same r + 2 nodes, going in opposite directions, and all edges have weight 1. One 
of the two endpoints is designated as the source s and the other as the sink t. For k > 1 the graph 
Gk,r consists of r copies of Gk-i,r and an additional source node s and sink node t, with a path 
from s to t of r + 1 edges visiting the sources of the copies in some order, and another path from 
t to s of r + 1 edges visiting the sinks of the copies in the opposite order. All the new edges have 
weight r . 

Next, as in |CGK06| we define a directed graph L^ ,. = iVk^r^ -E^fc.r) with nonnegative edge weights 
as follows. Suppose (s, vi), {v2,t), (t, ^3), (^4, s) are the edges incident to s and t in Gk,r- Then Lk^r 
is defined by taking G^^r, removing s and t, and including edges (^2, "Wi) and (^4,^3), both of weight 

Now fix some k > 1 and r > 2 and let n = \Vk^r\ = &{r^)- We define the edge distances 
d = (de)eeE on the complete directed graph Kn = (yk^r,E) as the shortest path distances in the 
weighted graph L^, ,.. Note that d is nonnegative and satisfies the triangle inequality, so this is a 
valid instance. 



Lemma 1 ( [CGKd6] ). For k >2 and r > 3 we have min^g^^mt^^^) d ■ x > (2k — l)(r — l)r 
Lemma 2. For k > 1 and r >2 we have min^g^v^^'pj^^^^^^-)) d ■ x < ^k{r + l)r^~^. 



fc-i 




Figure 2: The graph G- 



2,3- 



Letting k > 2 and r > 3 and combining Lemma [T] and Lemma [21 we find that the ratio on the 
left side of ([T|) is at least | • ~Jr+i)' ' which is at least 3/2 — e provided A; and r are large enough. 
It remains to prove Lemma [2l 



Proof of Lemma\^ Consider the vector x = {xe)eeEkr with Xg = 2/3 for all e G -Efc.r- We claim 
that X S A^(^Tbai(-^fc,r))- Then a simple argument shows that extending x with values for all the 
edges in E\E].^r yields a point x G A^(^rt,ai(i^n)) such that d ■ x equals 2/3 times the sum of all 
the edge weights in L^^r (using the fact that every edge in the weighted graph Lk^r is a shortest 
path from its tail to its head). An inductive argument shows that the sum of all the edge weights 
in Lfc^r is at most 2k{r + \)r^~^ (see |CGK06j ). so x witnesses LemmaO 

All that remains is to prove that x G iV(ATbai(-^A:.r))) aud this is the meat of the argument. In 
order to exhibit a protection matrix for x, we first need the following definitions. We call a path 
or cycle in a directed graph edge-simple if it does not repeat any edges, but may repeat nodes. 

Definition 1. Given a directed graph G' = (y',E') and an edge {u,v) £ E' , a frame for {u,v) is 
a set of edges F C E'\{(u,v)} that consists of an edge-simple path from u to v together with zero 
or more edge-simple cycles. The path and cycles are required to be edge-disjoint from each other. 

Claim 1. In the graph L^^r there exists a frame Fe for each e € -Efc.r; with the property that for all 
ei, 62 € Ek^r we have 62 S F^-^ if and only if ei G Fgj . 

We prove Claim [T] shortly, but let us now see how to use it to construct a protection matrix X 
for X. The rows and columns of X are indexed by E^^r, except there is an additional 0th row and 
0th column. We must have -'^o.o = 1 and Xq^^ = Xe,o = Xe,e = 2/3 for all e € -Efc.r- For ei 7^ 62 we 
set 

11/3 ife2GFe, 

1 1/2 otherwise 

Note that X is symmetric by the property in Claim [H Thus we just need to show that X^ G 
cone(ATbai(-^A:,r)) and Xq - Xe £ cone(Arbai(-^fc,r)) for ah e G E^^r- 



X. 



61,62 
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Figure 3: The graph G; 
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We first consider Xe- Since {Xe)o = 2/3, Xe € cone(j4rbai(iA:,r)) is equivalent to x^'^' G 
ATbi^\{Lk^r) where x^'^^ is the vector on E^^r that gives value 1 to e, value 1/2 to the edges in Fg, 
and value 3/4 to the remaining edges. To verify the balance condition x^'^' (6~^ {v)) = x^'^^((5~(v)) for 
all V G Vk,r, consider starting with the vector that assigns 3/4 to all edges. Since every node in L^^r 
has two incoming edges and two outgoing edges, this vector would satisfy the balance condition. 
Reducing the values on the path and cycles to 1/2 maintains the balance condition except at the 
endpoints of e, which are remedied by raising the value of e to 1. To verify the cut constraints, 
suppose we interpret x^'^' as capacities in a flow network. Then we need to show that every cut 
has capacity at least 1 in both directions, which is equivalent to showing that a unit of flow can 
be sent from any node to any other node. It is shown in [CGK06J that for every n, u G Vk^r there 
exist two edge-disjoint paths from n to w in L^^r- Since every edge has capacity at least 1/2 in 
x^'^' , we can send a half-unit of flow along each of these two paths. Thus we have shown that 

Xe G COne(^Tbal(-Z^fc,r))- 

Now we consider Xq — X^- Since {Xq — Xe)o = 1/3, Xq — Xg G cone(j4Tbai(-^fe,r)) is equivalent 
to y^^> G AT]^^i{Lk^j.) where y*-"^) is the vector on E^^r that gives value to e, value 1 to the edges 
in Eg, and value 1/2 to the remaining edges. The balance condition is verified similarly as before: 
The vector that assigns 1/2 to all edges would satisfy the balance condition, and raising the values 
on the path and cycles to 1 maintains the balance condition except at the endpoints of e, which 
are remedied by reducing the value of e to 0. To verify the cut constraints, we need to show that 
a unit of flow can be sent from any node to any other node, respecting the capacities y^^' . Every 
edge has capacity at least 1/2 except for e itself, so if we try to send a half-unit of flow along each 
of the two edge-disjoint paths from u to v, we only run into trouble if one of these paths uses e. 
In that case, the half- unit of flow through e can be re-routed via the path in the frame E^. This 
path may overlap with the two paths from u to v, but since each edge in Ef> has capacity 1, it can 
accommodate two half-units of flow simultaneously. Thus we can always send a unit of flow from 
u to V, and we have shown that Xq — Xg G cone(^Tbai(ifc,r))- 

To flnish the proof of Lemma O we just need to prove Claim [TJ For each edge e G E^^r there is 
a unique shortest path (in terms of number of edges) from e's tail to e's head that does not use e. 
The natural first attempt at defining the frames is to take Eg to be this path for each e. However, 
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Figure 4: Frame for a nonmediating inner edge at level 3 < i < k — 2. 
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Figure 5: Frame for a mediating outer edge at level 1 = 2. 

this does not work in general; it is crucial that we augment the path with some cycles. 

Each edge e G E^^^y. can be assigned a level £ G {1, . . . , /c}, meaning that e is introduced as part 
of a copy of Gi^r in the construction of G^^r (and the two new edges in Lfc^^. are at level k\ The 
most important distinction is whether i < k ov i = k. If i < k then we include in Fe the shortest 
path from e's tail to e's head that does not use e, and for every copy of Ge^r that is contained in 
the ambient copy of G^+i^j- (or in Lk^r ii I = k — 1) and that is not touched by this path, we include 
in Fe a cycle containing all the edges at level i in this copy of G^^r- ii i = k then we only include 
in Fe the shortest path from e's tail to e's head that does not use e, and we include no cycles. This 
completes the description of the frames. 

Let us be a bit more precise and give some illustrations. From now on, we use "terminal" to 
mean either source or sink, since there is no need to distinguish between the latter. We say an 
edge at level i < k is "mediating" if it is incident to a terminal when it is introduced in G^^^, and 
"nonmediating" otherwise. Also, we say an edge at level i < k — 1 is "outer" if when we start at its 
head and follow the direct route to a terminal of the ambient copy of Gi^r i the other edge entering 
that terminal is a mediating edge. The edge is an "inner" edge otherwise, and all edges at level 
k — 1 are considered inner. 

Consider an edge e = {u, v) at level I < k. There are four cases depending on whether e is 
mediating/nonmediating and outer /inner. The "typical" case is when e is nonmediating and inner 
and at level 3 < I < k — 2. This is illustrated in Figure [H with bold frame edges. Then u is a 
terminal of a copy of Ge-i^r, and the path starts by taking the direct route to the other terminal of 
this copy, then the direct route to a terminal of the ambient copy of G^^^i then one step to the next 
copy of Gi^r, then the direct route to the other terminal of that copy, then one step back to the 
copy of Gi^r containing e, then the direct route to the other terminal of the copy of Gi^i^r of which 
w is a terminal, and finally the direct route to v through this copy of Gi-i^r- This path involves two 
copies of Gi^r- For each of the other r — 2 copies of G^r in the ambient copy of G^+i^^; we include a 
cycle containing all the edges at level i in this copy of Gg^r- This completes the description of the 
frame Fe. Nothing changes if i = 2 or i = k — 1. When i = 1, there are no copies of "G^-i,r" to go 
through, so the path simply starts by going directly to a terminal of the copy of Gi^r containing e, 
and ends by going from the other terminal directly to v. 
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Figure 6: Frame for an edge at level i = k. 

When e is mediating, all that changes is that one of the two copies of G^_i,r- disappears; either 
li or u is already a terminal of the ambient copy of Gg^r- When e is outer, there is no "next" copy 
of Ge^r] the first step along an edge at level i + 1 takes us to a terminal of the ambient copy of 
Gi-^-l^r, then the next step takes us back to the copy of Gg^r containing e. Furthermore when e is 
outer, r — 1 cycles need to be included in the frame (rather than r — 2) since the path only touches 
one copy of Gg^r- Figure [5] illustrates the frame for a mediating outer edge at level i = 2. 

Figure E] illustrates the frame for an edge e = {u,v) at level £ = k. The path takes the direct 
route from u to the other terminal of the copy of Gk-i^r containing n, then follows the level-fe cycle 
containing this terminal until it gets to the copy of Gk-i,r containing v, then follows the direct 
route to V through this copy. This completes the more precise description of all the frames. 

It remains to be verified that for all ei, 62 € E^^j. we have 62 G -Fei if and only if ei G -Fe2- Of 
course, we just need to consider an arbitrary ei and show that for each 62 G Fei we have ei G F^,^. 
This is a slightly tedious case analysis. Rather than give full details (which would be pedantic) we 
give a couple illustrative cases. Suppose ei is the edge e in Figure HI If 62 is the third edge on the 
path in Fg^, then the path in Fgj first goes right to the other terminal in the copy of Gg^2,ri then 
goes down to ei's tail, then traverses ei, then goes up through the copy of G^-i,r containing ei's 
head, and so on. If 62 is the edge that appears right below ei on the path in F^^ , then the path in 
-Fe2 only touches the bottom two copies of Gg^r in the figure, and ei is contained in a cycle in F;: 



£2 



(the one for the copy of Gg^^ containing ei). This demonstrates the importance of including cycles 
in the frames. The other cases can be checked similarly. 

This finishes the proof of Claim [1] and the proof of Lemma [2j D 

3 Symmetric TSP Path Problem 

In this section we prove Theorem [2j For every e > we need to construct an instance d = {de)e€E 
on Kfi = (y, E) with s,t ^V such that for some r = jn with 7 > depending on e, 
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xeby (K„,s,t) ^ 3/2 -e. (2) 



T^'^'^xeNi(SPiK„,s,t)) d ■ X 



For integers l >1 and g > we construct an undirected graph G^^q = (yi^q,Ei^q) as follows (see 
the illustration in Figure [7]). The graph Gi^q has two horizontal node-disjoint paths each with i 
edges. It also has two cliques each with 3q + 3 nodes, a left one and a right one. The cliques are 
node-disjoint from each other and from the paths. The two left endpoints of the paths each have 
edges to all the nodes in the left clique, and the two right endpoints of the paths each have edges to 
all the nodes in the right clique. Finally, there is an additional node s with edges to all the nodes 
in the left clique, and an additional node t with edges to all the nodes in the right clique. 

Also consider the graph Gi = iV£„,Ei) (illustrated in Figure [S]) which is the same as Gi^q 
except that s and t are connected by a path with i new edges (using £ — 1 new nodes). We have 
Ve,q C Vlg and E,^q C E^^. 

Lemma 3. For all r > 0, i > 1, and q > the following holds. For all x = {xe)e&Ei , if x' = 
(Xg)eg_B' is the same as x but with all new edges e in Ei having Xg = 1, then x' G N'^(ST{G'^ )) 
implies x G N^{SP{G£^q,s,t)). 

Proof. We prove the lemma by induction on r. Suppose r = 0. Trivially, x'{6qi (v)) = 2 for 
all V G Vi^q implies x{5ciq{v)) = 2 for all v G Vi^q\{s,t} and x{5Giq{v)) = 1 for all v G {s,t}. 
Now consider an arbitrary set C S C Vi^q with |5 n {s,t}| / 1, and assume without loss of 
generality that \S n {s,t}| = 0. Then x'{6g' (S)) > 2 implies x{6Gt (S)) > 2 since none of the 
new edges is in 5g' (S). Now consider an arbitrary set C S C V^g with \S n {s,t}| = 1. Then 
x'{5q' (S)) > 2 implies x{6gi (S)) > 1 since only one new edge is in 6q' (S). Thus we have shown 
that x' G ST{G'f^ ) implies x G SP{Gi^q,s,t). 

Now suppose r > and the lemma holds for r — 1. Assume x' G N-^-{N''^^ {ST{G'^ ))) and 
thus it has a protection matrix X' (which has a 0th row and a 0th column, and the remaining 
rows and columns are indexed by Ei). Obtain a matrix X by deleting the rows and columns 
corresponding to the new edges in Ei. We claim that X is a protection matrix witnessing that 
X G N-^-{N^ {SP{Gi^q,s,t))). Note that X is symmetric and positive semidefinite since it is 
a principal submatrix of the symmetric positive semidefinite matrix X' . We also have Xq = 
diag(X) = (1 x). We just need to verify that Xe* G cone{N!^~^{SP{Ge,q,s,t))) and Xq - Xe* G 
cone{N'!^~ {SP{G£^q, s, t))) for each e* G E£^q. We just consider Xe* ; the case of Xq — Xe* is similar. 
We know that X'^l G cone{Nl-^{ST{G'^J)). Thus if X^. q = then X^. is all O's, so Xe* is ah 
O's and we have Xe* G cone(A''^~ {SP{Gi^q,s,t))). Otherwise X'^* ^ > and we have x"''^ ' G 
Nl~^{ST{G'^^^)) where x'('=*) is X^./X^* q with the 0th entry omitted. For each new edge e G E'^,^^, 
since Xq g = Xg = 1 we must have Xg, g = Xg, g, by a basic property of protection matrices. Thus 

Xe = 1 for each new edge e, and the induction hypothesis tells us that x^^*^ G N^~ {SP{Ge^q, s, t)) 
where x*^*^ •* is x'^^ ' with the entries for the new edges omitted, or in other words, x*-*^ ' is Xe*/Xg*^o 
with the 0th entry omitted. Thus we have Xg* G cone(X^~ {SP{Ge^q,s,t))) as desired. D 

Now fix some integers r > and i > 1 and let <; = r and n = |V^,r| = 2{£ + 1) + 2(3r + 3) + 2. 
We define the edge distances d = {de)e&E on the complete undirected graph Kn = iVn^riE) as the 
shortest path distances in the unweighted graph Gi^r- Note that d is nonnegative and satisfies the 
triangle inequality, so this is a valid instance (together with the distinguished nodes s,t). 
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Figure 7: The graph ^4,1. 
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Figure 8: The graph G'^ j^. 




Lemma 4. We have niin^g_5p 



'"'{_ft:„,s,t) 



d-x>U-2. 



Lemma 5. We have min^g^yr {^sP(Kn,s,t)) d ■ x <2l + Qr + 9. 

Combining Lemma [H and Lemma El we find that the ratio on the left side of ([2D is at least 
2^6^+9 1 which is at least 3/2 — e provided i. is large enough and r < 7'^ for some small enough 
7' > depending on e. The latter is implied by r = 772 for some small enough 7 > depending on 
7'. It remains to prove Lemma [Hand Lemma [5l 

Proof of Lemma [7} This is equivalent to lower-bounding the cost of a (non-simple) s-t path in G^^r 
that visits each node at least once. Given such a path, first note that at most one of the edges 
on the top and bottom paths in G^.r is not traversed (otherwise some node would not be visited). 
We claim that for either the top path or the bottom path, all but at most one of the edges are 
traversed at least twice. Suppose not; then there exist two top edges and two bottom edges, all 
four of which are traversed at most once. By symmetry, assume the first of these four edges to be 
traversed is on top; then the s-t path cannot return to the left clique without first going to the 
right clique. Once in the right clique, the path must traverse at least one of the two special bottom 
edges, but then there is no way to get back to the right clique to end at t. This proves the claim. 
These observations imply that the contribution of the top and bottom paths to the cost of the s-t 
path is at least 3£ — 2. D 

Proof of Lemma\^ Consider the vector x = (xe)e£Eir defined as follows. The edges e on the two 

All remaining 



2-l/(r+l) 



paths all have Xe = I. The edges e within the two cliques all have Xe — 3^ ,2 

edges e have Xe = l/(3r -|- 3) (that is, those edges that go between a clique and an endpoint of one 

of the paths, or are incident to s 01 1). Let x' = (Xg)eg_B' be the same as x but with all new edges 



^e.r 



e in E'^^ having x'^ = 1. Cheung jCheOSj proves that x' £ N^{ST{G'^^)); by Lemma[3]this implies 
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X € N^(SP{G£^r,s,t)). Then a simple argument shows that extending x with values for all the 
edges in E\Ei^r yields a point x G N^(SP{Kn,s,t)) such that d- x = J2e€E ^e (using the fact 
that every edge in Gi^r is the shortest path between its endpoints). We have 

EeeE,,.^e = 2M + 2(%+3).^Z5±i)+6(3r + 3).3^ = 2£ + 6r + 9 
so x witnesses Lemma [5l D 

4 Open Problems 

There are many open problems on approximability and integrality gaps for variants of the traveling 
salesperson problem; we give a small sample. 

It is open to prove a nontrivial lower bound on the integrality gap of the standard relaxation for 
the symmetric TSP tour problem after two rounds of the Sherali- Adams procedure. A key technique 
used in most previous work on Sherali-Adams integrality gap lower bounds is the technique of 
constructing "locally consistent distributions" (introduced in [ FdlVKM071 ICMM09] ) . However, 
this technique only seems to apply to relaxations with local constraints, whereas the cut constraints 
for TSP are very global. 

Regarding Theorem [H it would be interesting to overcome any of the four deficiencies described 
in Section II. 2i 

Finally, can the lift-and-project procedures of Lovasz-Schrijver or Sherali-Adams be employed 
to get improved algorithms for any variant of the traveling salesperson problem? 
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